rank | frequency | n-gram |
---|---|---|
1 | 9358 | -а |
2 | 6798 | -н |
3 | 5812 | -ы |
4 | 4816 | -р |
5 | 4419 | -е |
rank | frequency | n-gram |
---|---|---|
1 | 2586 | -ан |
2 | 2399 | -ар |
3 | 2228 | -ың |
4 | 1882 | -да |
5 | 1639 | -га |
rank | frequency | n-gram |
---|---|---|
1 | 2210 | -ның |
2 | 1731 | -лар |
3 | 1216 | -нең |
4 | 1012 | -ләр |
5 | 935 | -нда |
rank | frequency | n-gram |
---|---|---|
1 | 698 | -ында |
2 | 694 | -лары |
3 | 540 | -ының |
4 | 511 | -ләре |
5 | 488 | -рның |
rank | frequency | n-gram |
---|---|---|
1 | 423 | -арның |
2 | 339 | -ыннан |
3 | 284 | -ларны |
4 | 272 | -ларга |
5 | 247 | -ларын |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings